Seventh Workshop on Patent and Scientific Literature Translation
نویسندگان
چکیده
The invited talk concentrates on recent developments in WIPO Translate. We will notably highlight our experience in using Neural Machine Translation (NMT) in production on PATENTSCOPE since September 2016. We describe the training and decoding pipeline, a first quality evaluation experiment (comparing automatic metrics with publically available commercial tools and the impact on post edition) and will demonstrate the use of NMT on fast GPU servers (publically available in 10 languages from September 2017).
منابع مشابه
Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop
In aiming at research and development on machine translation, we produced a test collection for Japanese-English machine translation in the seventh NTCIR Workshop. This paper describes details of our test collection. From patent documents published in Japan and the United States, we extracted patent families as a parallel corpus. A patent family is a set of patent documents for the same or rela...
متن کاملOverview of the 3rd Workshop on Asian Translation
This paper presents the results of the shared tasks from the 3rd workshop on Asian translation (WAT2016) including J↔E, J↔C scientific paper translation subtasks, C↔J, K↔J, E↔J patent translation subtasks, I↔E newswire subtasks and H↔E, H↔J mixed domain subtasks. For the WAT2016, 15 institutions participated in the shared tasks. About 500 translation results have been submitted to the automatic...
متن کاملOverview of the 2nd Workshop on Asian Translation
This paper presents the results of the shared tasks from the 2nd workshop on Asian translation (WAT2015) including J↔E, J↔C scientific paper translation subtasks and C→J, K→J patent translation subtasks. For the WAT2015, 12 institutions participated in the shared tasks. About 500 translation results have been submitted to the automatic evaluation server, and selected submissions were manually e...
متن کاملOverview of the Patent Translation Task at the NTCIR-7 Workshop
To aid research and development in machine translation, we have produced a test collection for Japanese/English machine translation and performed the Patent Translation Task at the Seventh NTCIR Workshop. To obtain a parallel corpus, we extracted patent documents for the same or related inventions published in Japan and the United States. Our test collection includes approximately 2 000 000 sen...
متن کاملOverview of the 4th Workshop on Asian Translation
This paper presents the results of the shared tasks from the 4th workshop on Asian translation (WAT2017) including J↔E, J↔C scientific paper translation subtasks, C↔J, K↔J, E↔J patent translation subtasks, H↔E mixed domain subtasks, J↔E newswire subtasks and J↔E recipe subtasks. For the WAT2017, 12 institutions participated in the shared tasks. About 300 translation results have been submitted ...
متن کامل